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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1334567890 1234567890 1234567890 1234561890 1234567890 12345 67890 1234567890 1234567890 1234567890 1234567890 



Clal 

SpeL Ncol E^eR7I Earl 



ATCGATAGTC GKACCAAAT GGMTTICGG ATTICTCACT AGIOCAIGGC TCACAATTTA CAAAATCICG AGAAAAGAAA GGATGCAAGG ACTATGAAGA 



Sspl Deal BstBI 

GCOTCCGAAT CTAAAmTIT TAATTTAAAA MATCAATTT CGAATTGAAA TICMCTCCT ACITOITTTG AMATGCCAA TCCTITAAGT AAACTTCTGG 



BstBI 

AICGCGCATT TCITCCAGAA ATKXTICAA AGTAGTGGfTT TICTACIGAT TTOTQOGCA AAGAATAQGA ACTTT03AAT CTGCT3GAGC GAAAOGQGAT 



Sspl 

TITSATAACA AAMACTATC CA^CAAAOC ATAGGACHT 'FICAAATMT GCTTATITGG CTGTOATIT GGAAGCAQCC MTCTTTAAC GCIGTCCAGC 



CAGAAGTGCT GCACICGCTA A3GAIAAAAG QCICATWIT GAAGCOGAAT TTTACTAAAA TCIUTAGCCA TGGAGTCGAT GCATCAGAAA TTQGAGGAAT 



titagaitk: atctigaaat tigcaatgga aaaaataatt aticaaagaa mtcacagaa aatgcaacaa maaaacaaa aaaag^acaa aaaacaagtic 



ami Sari 
Xral E^3I 

GAAAAGTGCG CCCGGGTOC?r TIGCIGACGC ASOCTKM A0GAGA0G03 CIGCTCGCGC ACITCICGTG OOCTCIGOGr GCATTTCQGC MCAAAATTC 



aacacitcttt togaaacgca ctgcxxtott tctttttka atitigataa gaaaatcagc AITCTITCAG GATGATTAAC ATTCCAACTG OGATTCIGTG 



r 3 , a. o? - s- 



ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1234^7390 1234S67890 12345673Q0 123456789 0 1234567890 1234567890 1234567890 1734567890 1?34567890 1?34567890 

Pvul 

Narl Clal 
KasI B^)DI 
OCGCriGQGC GCa\GAra?r 03OTTCCCG CKXTITGGA. ACKTC!GATCG TCAOCAAGGT GQQGATTTTr TGAATTTITC OGTCAAAATT GITCATTTTT 



900 



Asel Sspl 

TCICTA2GCA TGAAGGAG&A ATGTTATAACA GACAOVTTCT TTICMTTM TMTIHEAA WICftCAGT COGAGGCAAA C^OGOAATC O^AGTTCG 1000 



Bco47III 

GA.TGGGAATA OTOITCAAG CAG^CTCCA AGAA«T03CCC AATCGCTCCA CAIUTCACQG TCTACCAGCC ACAATTGACC TGGATOCICP OOGGAnCCA 1100 



TAGAATCAGC GGTTCTKTE&A TGGCXX5GAAC OCTICia?IC GGAGGAATC3 GATTGGCAGT THGCQGTPC GATTTCACGG CTTTTCTOGA TTTCATQCGP 1200 



AGCT3GAACT TAGCATGOGC G^CGGCT GICTICAAGT ACATCATTGC TTTCHXATC ATTTTOCATA CICTTAAQGG MTTQGCTTC: TTAQGATTCG 1300 



Asel Deal Asel JBglH 

ATFTGGCTAA GGGAGTCAAT MTGITOGAC AGGTAGGACjT TCAMITATT AATTTAATTG TTITAAAATA AAAATTAATT TTCAGATCTA CAAATGQGGA 1400 



mTCTCGTAT CTOGftCITIC GGCTATICTT GCICTCQCCA TIGTCTICAA CTCITGOCAG AACAAGAGCA ACAAGACIGC CTAGGCACAG ATCTIOOGOC 1500 



TICNTinC TTM033CC GCAGCGOKG ACAATICTCG TCAATTTACT TTTACOGnG ATITCTIQGA TTTICICICT TnOOGTAGA TTTAOOICIC 1600 



O 



ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

12345ff7890 1234^7fiQ0 1234567^0 1234567890 1234567890 1 234567890 1234567890 1234567890 1234567890 1?,34567890 



Earl Xtal 

CICTKCTIT TmriCIUT GTCTAGAATG TATATrATOA TTATGAAAAC GAATAAAAAT TTTAGATGAC ACGCTGCACG GCGGACAACT OXTGA03AA 1700 



T003GCGTAT OGGCGACGAA GGATQGOGAC TGGCGAGA1G AAGGAGTHC TQQGGATAAA AGGCACAGAG GQCAO0GATT TTGGAATCAA TAGTGATGCT 1800 



Mini E^rl 

CAGGACTTGC CATCACCGAG TAGGCAGGCT TCGACGCGAA GAATGTCCAT CQGAGACTCA ATTCMGGftA AAATCAATSA TTGGG&AGAG OGAAGGCTTG 1900 



AmiO^GGG ATTTCIGGTA ATITTTIAAT ' I ' lTl ' l'l'lM CT AAATAAAATT TQCIGCIGCT TOGAGGfTOGA CTATTTCACG CAO0GAATCC QGCMAACGG 2000 



TTTGGAGCAC CGGGATTGQC GK3IGGAGTC CAACQ3GAGC AGGAAA1GAT GOGA3TTATG GG?ACGATAT TO3AGAAGAA GCACGCCGAA 2100 



Bsal EVuII 
MTTTIGA3A CCTICraiGA CTAGCTCCIC GCMGCCCA GAATCItATr TKACTGTAT CAGGATGTOG TICCXjACGGT TCCMATGCA CAGACAGATC 2200 



BstBI 

AATOTOCAAT GICTTATGGA aOTIGGTAA GGGAGAAAAT ACIGAAAAAA AGFITGCAAA AATTCGAAAA TPOGCXAGAA AGGTGGCAGA AAAAACATIT 2300 



QCAAAAATTC TTIGTITICC TTCAGGAAAT CAQCAAAACT TGGTCAAAAA TAGQGCAATT ATGTC?TCITr TITGAAAGTT TTOCATTAAA AA^XACGAA 2400 



-3- G $r ~ 2r 



ced-9 genomic 93Q6Q& Sequence 



10 20 30 40 50 60 70 SO 90 100 
1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 

.CJL-UTU. O&UDl ULCLL 

7ITIGATCCC GGAT'TOrMT TmTTIGTT GATAAATTAG CAGAAAACTT TACGAATTCG ATTAAAAACG TTATTTTCTA TTOGAATATT TTTAAAQCAT 2500 



Eglll ncal 
AmTOOTG AOTGEATPT GQGAAAAAGA TCIGCTGATT TATCAAAAAT CGGTITITAA ATGIAAAATT TGTGGAAAAT ACATTAAAAT TCGATTITTG 2600 



BstBI EstBI 
AACTTTTOTC TTGGAAAAAC AGGITITICT GCIGATTK3C TGAAQGAAAA AQCOCAAAAA TTCAATTTTC GftACATTAAA AAQCAGAAAA ATOGTI'ITIT 2700 



Hiix3lII 

TAAGCITAAT TTIC03XAG AAATGAACGA ATTAMTTGC AAATXTCTAA THTCAGATA GGTCTAATCT CGTTCGGQGG TnOGTAGCT OCAAAMTGA 2800 



RstI Earl BairHI 

^ ▼ ^ 

TOGAATCQGT GGAACTGCAG GGACAAGTGC GAAACCICIT CGHTACACA TCGCTGTTCA TCAAAACGCG GATCCGCAAC AACTGGAAGG AACACAATOG 2900 



Sral 

GAGCTGGGTA AGGAGTATTT GCATAGACAT TAGAAGTCAA TATDXCCIT TOXTAGTAC CTHGACTFC QOGGGGrGIT GGTAAGCQGA TMTTACAGG 3000 



EVuII Berril 
GITCDGTAGC CTCITGGGGG GACAGCIGGA MCATATTCA AGTATATTAC TGTTTATGAT AATGTTATTG TTAOGGGAAT ACAAAATTCG CAGAATGCEA 3100 



Dral Deal 

TTTCACAACA TATTTGACGC GCAAAATATC CAGTAGAGAA AACTACAGIA ATPCTTTAAA TTTTTAAMT TTTTAGOTT AAAGAAAATA ACCACTAATC 3200 



Asel Iial 
AAAAGAAATT MTTICAAAA ATQGAGCOQS TAAATCGACT ACAGTAGGCA TTTAAAGAAT TACIGTAGTT TTCGCTACGA GATATTTQCG CUICAAATAT 3300 



P'5 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

1734567890 1234567890 1234567890 1234567890 1234567890 1234567890 123456 7890 1234567890 1234567890 1234567890 



Bsril 

GTIGK&AAT POnmChC GGATTTTTCT GTTOXXJGGA AWGCICIA AAGCATTATT TGTGAAAATA AAAAATCAAG AAAAAAATTG CAGGACGA.CT 3400 



EspHI 

^TCACACT (X5GAAAACAA ATGAAAGAGG ACTACGAACG AQCAGAAGCT GMAAAGTOG GA33C03G?\A GCAGAACAGA QGGTGGTCGA TGATTCGCGC 3500 



PvuII ^ 

TCGAGTAACA GCK3GAOTA TTOGAATCCT GIGIUK3GQC GCiATGATOIT CAGCTIGAAG TAAOGTATTC AATTKJTGTA AATAA1TAAT 3600 



TIATCTACAA CIOTTACAT TT^ATCTCA mTTGCTCA CTCMTCTCT CATQCTITGA ACTGGAAGAA GIGGGAAAOC TAGGQCACAA ATTAQ3GCIC 3700 



^ MscI 

v _ 

3 TCK3IK7TCGA TTIAaGATTT TACTGCAATT TTTTCDSATT GO LTrriTiT TTGGCCAAAC CCTAdTOGG OGTAATATCA ACmTCQGT GTIUIGTACA 3800 



Earl 

TITOTICAAA AACCCIGAAA aXTAACTIT TCICQC03FG GOCTAGQCTC GOGCTTCTCT TGCACATTTC CAAAGTAOX CTGTATCKA ATAATTCA1C 3900 



SplI 
EsiWE 

Ekrl Mlul 

_____ ________ 

TICACTHAA CTGIUICITT TCGTGTQGCn TCTTOCAACT GXXXCAAAT TOTGTACGC CTAQ3QGACT TTGTATITAT TTTTTTCAAA TTGTTTIUrC 4000 



TCTACAACAA CAAA?AAAAC QGTICTITIA TTCAACQCTT TTTTCGGAAC GMACIGCM TTTTGATAAT AGGOGTCOX AAGAGAATCC OGTITITCATr 4100 



ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 
1734567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234 567890 1234567890 1234567890 1234567890 

XhoE 

PaeR7I 

Esp3I 

Earl 
▼ w 

TICGCCATCA CGTOCMCCaA AAAAGTTTAG TAGGAAAAEA TCATTTITTA AIATAATGAT TCATOTICT COXTCTICT GTCTOGAGAC GAQ3GICAAT 4200 



BstBI 

TGGATGGQCrr TGAATTTTTC GAAMCMAA A RJ1 T 1T1 GT TTAGIGTAM CGATCQCTOC GOOTATCGC TGTTTCAQCA TCAGATAGGC TCQGOCATTF 4300 



ApaLI 

GATTCCCTFG AATITIGTCG GTATATAAAA CAAAAAACGT TAGTTGCACGA TICAAAAAAC AACAAIGCGT GCITTACTAT TCACCICTGT TGTTCnTIG 4400 



Earl EfcoRI 

GCHTO3CIT TTGTTGAGGC AAAGAAGCAG ACTATCACIG TCAAGGGTAC AACTATTTGT AATAAGAAGA GAATTCAGCM GRAGGITACC TTTGGGAGAA 4500 



StUl 

AGATACTOGT GAGTPITKAG TCTIUTITAG CTIGAAACGG CTIAAAAAGG ACTAAAAAGG CCTAAAAATT GAAGTPTICC ACCIOITITC AAAAGAAAGC 4600 



GGAATTGCAC AGCITIACAC GAGATTICIC AATAATTTGT ATTTGAAAIT TTCATATTCA TOXTAAACG TTCITTACAC GAMTTTTGC GATTTTIGAG 4700 



Bsal tol 
^ w 

CITAAAATAC GATAQCTGGT OT03ACACGA AACATHTIG TFAAATTCAA AMGATGTGC GCCTTIAAAG AGIGCIUTAG TTTGAAACTT CIOTIGITGC 4800 



Clal 

BspDI ^ 
GGACTTTICA TO^TTmC GTAGCGITIT TTTATAAGAA AAATGTATTT ATFTATICAA AMTTTAATT TTAOGGAATC GOGAAAAACA AAATGAAGAA 4900 

Sad 
MscI 



Fig, a. 
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ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 

17.34567890 1234567890 1234567890 1234567890 1234567K90 12345678 90 1234567890 1234567890 1334567890 12345678^0 



Seal Bsal KasI Apal 

CACGGATAAA AATATOXAG CAACAATAGT TTGAAATTAC AGTACTCTIT TAAQGN3NKC AOOTICCTA TATTTCACAC AAACFIGTCG 5000 



aral ^Hindlll 

GQGTATCGTrC ATTTTGATO: AGAAATCAAG AAAATTGCAT ATATGTTCAA AAAACCACAA TTATGGCGAA TTTCAAGCIT GAAAQGAAAA TTCAGGAAAT 5100 



BstBI ^NSel 
TCTAAAAATT AAAAAAAAAT CATTOGAAAT GIGAAATTTG ATATTCAACT TGAAGfTOCAT ATGGCAAATT TOGTCTATIC Q3WITQGA NNATIMT 5200 



MI 

QCAQCTPGGGC GCGAAAAGAG AAAGCAGGAN NACTCATHC TGGCAATTIT TTCCTGTACC GTGICAATTA TTTGAAACTC TAATAAGCTG CTATTTITCr 5300 



Sspl 

GCTATT3ACA ACIAACT3AA TCGATAATIT GCAATTATAA TATK^dTT TGATGTICTGG CTTAGAAAAA AAAAAOCAAA AACCICATCT AGCTITAGGC 5400 



Avrll ^ a3J 
TQCCAATATA TTCCIAGGAC ATATAAAAAA GCCTIAAAAT TCICTGCAAC AOCTACAAGC TATCAAA03T ACTATTAGTA TTCAATITrC CAGIOGACCC 5500 



BarrHI 

CGA'IGACAAG CTKECCTCAA TCCAATCGAA CAAAGAAGGA GAGTICICAC TTACCGGATC OGAOGACGAG ATCACCTCAA TCIUTO2ATA CCICATMTC 5600 



AOCGACAACT GCAAQGTGAA GAAGGCCGGA TGCAAGOC7IG TTTCAGACTA TTPGATTOCA AAGGAGAAGA TGGGTGGAAC CTATGATATG ACATAGGTCA 5700 



Pig, a ^ - 8" 



ced-9 genomic 930608 Sequence 



10 20 30 40 50 60 70 80 90 100 
1234567890 1234567890 1234567890 1234567890 1234567890 17.345 67890 1234567390 1234567890 1234567890 1234567890 

CICHGATAT OCTTIOCGCT AAAG?OAGG AGAAGTGCEA AGAAMTGrTT TTnTTGITT QGirTGCTTG TITOGAAGGG AAGGACITTC TATCICTTTr 5800 



MTTCAACAA TAAACIATTG GAAMCOGTTT G^TTTTAA CXOTGAACIG TMGAAAAGT TGCGTCATTA TGTIGACAAT TITCCTAAOT ATATCTITGr 5900 



Bxm Sspl Asel BsmE 

GGA1ATCACA ATAAAQ3AAG TCAAAGCACG AAATATTA03 GMACACAAA ATTAATGAGA AT3CGCAACA TATTTGAOCG OAAATATCT CGTTAGGGkAA 6000 



Bco47III Sad 

CTACAGTAAT TCITCAAAAG ACIACTCTAG CGCIGKjIOG ATTTACGAGC TCGATFHTG AAATGAATCA GACTAGAAGA AAAGGAGGA^ AATATTGAAC 6100 



Hill Bbsl 
ATCAATTC&A CATCAATICA AAAAGIOGAA OXTIGACm CAGTAC7ICIT CTAAAGAATT ACrGTASTTT TO3CTACGAG ATATTTTGNG NGTCAAATAT 6200 



GiTO^AAT ACQCATCCIC AGAATTGTGT GITCICGTAA TGIUITGAM ATTTTQCATT TCAACATCAA ATAAGCAMT CTAAAAATGT GGGTTCTGCA 6300 



PstI 

GGGACCACTA TGACIGTGAT OTIGGCAAGA GCCACTCAGA AAACTAOGTC TTCCTT^AA CAAATACATT TITMGrFATT GTAGGTATAA AAATT3ITGG 6400 



M Sail ^Bbsl ^inSCII 

CTAGCAGICT AGGCIGCCTT TTTCACTCGA C2AACTICIA ATTTAAKGG CGGGTCTKA AAAAGrTCGIT TCTITGAAAA TATAAAGCTT TATATATTTA 6500 



BooK/ ^el 

TATATIAAAA ATFITGA'FIA CATGATATCA AAAGGGACTA GtfTTGTATAA AAATTATCAA 6560 



-8- O 



10 20 30 40 50 60 70 80 90 100 
1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 1234567890 

TTTGAGATGA CACGCTGCAC GGCGGACAAC TCGCTGACGA ATCCGGCGTA TCGGCGACGA ACGATGGCGA CTGGCGAGAT GAAGGAGTTT CTGGGGATAA 100 
MetT hrArgCysTh rAlaAspAsn SerLeuThrA snProAlaTy rArgArgArg ThrMetAlaT hrGlyGluMe tLysGluPhe LeuGlylleL 

AAGGCACAGA GCCCACCGAT TTTGGAATCA ATAGTGATGC TCAGGACTTG CCATCACCGA GTAGGCAGGC TTCGACGCGA AGAATGTCCA TCGGAGAGTC 200 
ysGlyThrGl uProThrAsp PheGlylleA snSerAspAl aGlnAspLeu ProSerProS erArgGlnAl aSerThrArg ArgMetSerl leGlyGluSe 

EcoRV Sail 

AATTGATGGA AAAATCAATG ATTGGGAAGA GCCAAGGCTI GATATCGAGG GATTTGTGGT CGACTATTTC ACGCACCGAA JCCGGCAAAA CGGAATGGAA 300 
rlleAspGly LysIleAsnA spIrpGluGl uProArgLeu AspUeGluG lyPheValVa lAspIyrPhe IhrEisArgI leArgGlnAs nGlyKetGlu 

TGGTTTGGAG CACCGGGATT GCCGTGTGGA GTGCAACCGG AGCACGAAAT GATGCGAGTT ATGGGAACGA TATTCGAGAA GAAGCACGCG GAAAATITTG 400 
TrpPheGlyA laProGlyLe uProCysGly ValGlnProG luHisGluMe tMetArgVal MGlyThrl lePheGluLy sLysHisAla GluAsnPheG 

PvuII 

AGACCTTCTG TGAGCAGCTG CTCGCAGTGC CCAGAATC1C ATTTTCACTG TATCAGGATG TGGTTCGGAC GGTTGGAAAI GCACAGACAG ATCAATGTCC 500 
luThrPheCy sGluGMeu LeuAlaValP roArglleSe rPheSerLeu TyrGlnAspV alValArgTh rValGlyAsn AlaGlrThrA spGlnCysPr 

PstI 

T 

AATGTCITAT GGACGTTTGA TAGGTCTAAT CTCGTTCGGC GGTTTCGTAG CTGCAAAAAT GATGGAATCC GTGGAACTGC AGGGACAAGT GCGAAACCTC 600 
oHetSerTyr GlyArgLeuI leGlyLeuIl eSerPheGly GlyPheValA laAlaLysMe tMetGluSer ValGhLeuG InGlyGlnVa lArgAsnLeu 

BanEI EspEI 
▼ ▼ 

ITCGTTTACA CATCGCIGTI CATCAAAACG CGGATCCGCA ACAACTGGAA GGAACACAAT CGGAGCTGGG ACGACITCAI GACACICGGA AAACAAATGA 700 
PheValTyrT hrSerLeuPh elleLysThr ArglleArgA snAsnTrply sGMisAsn ArgSerlrpA spAspPheMe tThrLeuGly LysGMetL 

PvuII 

T 

AAGAGGACTA CGAACGAGCA GAAGCTGAAA AAGTGGGACG CCGGAAGCAG AACAGACGGI GGTCGATGAT TGGCGCTGGA GIAACAGCTG GAGCCATTGG 800 
ysGluAspiy rGluArgAla GluAlaGM ysValGlyAr gArgLysGln AsnArgArgI rpSeiMetll eGlyAlaGly ValThrAlaG lyilalleGl 

Asel 

T 

AATCGTIGGA GTCGTCGTGT GTGGGCGGAT GATGTTCAGC TTGAAGTAAC GIATTCAAIT TGTGTAAATA AITAAIITAT GTACAACTCC TTACAITTGA 900 
ylleValGly ValValValC ysGlyArgHe tHetPheSer LeuLys... 

ATCTCATTTT KGCTCACTGA TICTCTCATC CTTTGAACTG GAAGAAGIGG GAAAGCTAGG CCACAAATTA CGGCICTCTG TGTCGATITA CGAITTTACI 1000 

Ball 

T 

GCAAT1TTTT CCGATTGCCT TTTTTTTTGG CCAAACCCTA CTTCCGCGTA ATATCAACTT TTCCGTGTTC TGTACATTTC GICAAAAACC CTGAAACCCT 1100 



AACITTTCIC GCCGTGGCCT AGCCICCCGC TICTCIICCA CATIICCAAA GTACCCCTGI ATCTCAAIAA TICATCTTCA CTTTAACTG1 CTCTTTTCGT 1200 

SplI 
Mlul 

GTGGCCTCTT CCAACTCCCC CCAAATTCCT GTACGCGTAC GCGACTTIGT ATTTATTTTT TICAAATTGT TTTCTCTCTA CAACAACAAA AAAAACGGTT 1300 



CAAAAAAAAA AAAAA 1315 

Fig. 3 



10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

MTRCTADNSL TNPAYRRRTM ATGEMKEFLG IKGTEPTDFG INSDAQDLPS 50 

PSRQASTRRM SIGESIDGKI NDWEEPRLDI EGFWDYFTH RIRQNGMEWF 100 

GAPGLPCGVQ PEHEMMRVMG TTFEKKHAEN FETFCEQLLA VPRISFSLYQ 150 

DWRTVGNAQ TDQCPMSYGR LIGLISFGGF VAAKMMESVE LQGQVRNLFV 2 00 

YTSLFIKTRI RNNWKEHNRS WDDFMTLGKQ MKEDYERAEA EKVGRRKQNR 250 

RWSMI GAGVT AGAIGIVGW VCGRMMFSLK 2 80 



Fig. 4 



MTRCTADNSL TNPAYRRRTM ATGEMKEFLG IKGTEPTDFG INSDAQDLPS 50 



PSRQASTRRM SIGESIDGKI NDWEEPRLDI EGFWDYFTH RIRQNGMEWF 100 

nl 653 ts (TAT -> AAT) N 

t 

GAPGLPCGVQ PEHEMMRVMG TIFEKKHAEN FETFCEQLLA VPRISFSLYQ 150 

E (GGA -> GAA) nl950 

t 

DWRTVGNAQ TDQCPMSYGR LIGLISFGGF VAAKMMESVE LQGQVRNLFV 200 

I 

STOP (CAG -> TAG) n2077 
YTSLFIKTRI RNNWKEHNRS WDDFMTLGKQ MKEDYERAEA EKVGRRKQNR 250 



RWSMIGAGVT AGAIGIVGW VCGRMMFSLK 



Fig. 5 



Gap Weight: 3.000 Average Match: 0.540 

Length Weight: 0.100 Average Mismatch: -0.396 

Quality: 89.8 Length: 298 

Ratio: 0.376 Gaps: 11 

Percent Similarity: 47.059 Percent Identity: 23.077 



1 ...MTRCTADN SLTNPAYRRRTMATGEMKEFLGIKGTEPT 37 

. | . - || . | . . . : | : . . | : : . . : • • | • 

1 MAHAGRTGYDNREIVMKYIHYKLSQRGYEW. . . DAGDVGAAPPGAAPAPG 47 

3 8 DFGINSDAQDLPSPSRQASTRRMSIGESIDGKINDWEEPRLDIEGFWDY 87 

| : . . : . | . : | | : : . | . : 

48 IFSSQPGHTPHPAASRDPVARTSPLQTPAAPGAA 81 

88 FTHRIRQNGMEWFGAPGLPCGVQPEHEMMRVMGTIFEKKHAKNFETFCEQ 137 

::|:| I H I- I — - 

82 AGPALSPVPPWHIjALRQAGDDFSRRYRGDFAEMSSQ 118 



m • - 

138 LLAVPRISFSLYQDWRTVGNAQTDQCPMSYGRLIGLISFGGFVAAKMME 187 

I . I . . : : . . I I • : • • • | | :::::.[ | | . : | 

119 LHLTPFTARGRFATWEELFRD GVNW[(^IVAFFEFGGVMC . . . VE 160 

conserved residue mutated in n1950^^ 

188 SV. .ELQGQVRNLFVYTSLFIKTRIRNNWKEHNRSWDDFMTL.GKQMKE. 233 

|| | : . | | : : : . : : - — s • I : • I : M • I : • I I • I : • 
161 SVNREMSPLVDNIALWMTEYLNRHL . HTWIQDNGGWDAFVELYGPSMRPL 2 09 

• • • 

234 .DYERAEAEKVGRRKQNRRWSMIGAGVTAGAIGIVGVWCGRMMFSLK 280 

| : . : . : : | | • : I I I • : I I 

210 FDFSWLSLKTL LSLALVGACITLGAY LSHK 239 



Fig. 6 
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Fig. 7-1 



501 GTCCCCGGCT CCTTCATCGT CCCATCTCCC CTGTCTCTCT CCTGGGGAGG 
551 CGTGAAGCGG TCCCGTGGAT AGAGATTCAT GCCTGTGTCC GCGCGTGTGT 
601 GCGCGCGTAT AAATTGCCGA GAAGGGGAAA ACATCACAGG ACTTCTGCGA 
651 ATACCGGACT GAAAATTGTA ATTCATCTGC CGCCGCCGCT GCCAAAAAAA 
701 AACTCGAGCT CTTGAGATCT CCGGTTGGGA TTCCTGCGGA TTGACATTTC 
751 TGTGAAGCAG AAGTCTGGGA ATCGATCTGG AAATCCTCCT AATTTTTACT 
801 CCCTCTCCCC CCGACTCCTG ATTCATTGGG AAGTTTCAAA TCAGCTATAA 
851 CTGGAGAGTG CTGAAGATTG ATGGGATCGT TGCCTTATGC ATTTGTTTTG 
901 GTTTTACAAA AAGGAAACTT GACAGAGGAT CATGCTGTAC TTAAAAAATA 
951 CAAGTAAGTC TCGCACAGGA AATTGGTTTA ATGTAACTTT CAATGGAAAC 
1001 CTTTGAGATT TTTTACTTAA AGTGCATTCG AGTAAATTTA ATTTCCAGGC 
1051 AGCTTAATAC ATTGTTTTTA GCCGTGTTAC TTGTAGTGTG TATGCCCTGC 
1101 TTTCACTCAG TGTGTACAGG GAAACGCACC TGATTTTTTA CTTATTAGTT 
1151 TGTTTTTTCT TTAACCTTTC AGCATCACAG AGGAAGTAGA CTGATATTAA 

12 01 CAATACTTAC TAATAATAAC GTGCCTCATG AAATAAAGAT CCGAAAGGAA 
1251 TTGGAATAAA AATTTCCTGC GTCTCATGCC AAGAGGGAAA CACCAGAATC 

13 01 AAGTGTTCCG CGTGATTGAA GACACCCCCT CGTCCAAGAA TGCAAAGCAC 
13 51 ATCCAATAAA ATAGCTGGAT TATAACTCCT CTTCTTTCTC TGGGGGCCGT 
1401 GGGGTGGGAG CTGGGGCGAG AGGTGCCGTT GGCCCCCGTT GCTTTTCCTC 
1451 TGGGAAGGAT GGCGCACGCT GGGAGAACGG GGTACGACAA CCGGGAGATA 
1501 GTGATGAAGT ACATCCATTA TAAGCTGTCG CAGAGGGGCT ACGAGTGGGA 
1551 TGCGGGAGAT GTGGGCGCCG CGCCCCCGGG GGCCGCCCCC GCACCGGGCA 
1601 TCTTCTCCTC CCAGCCCGGG CACACGCCCC ATCCAGCCGC ATCCCGCGAC 
1651 CCGGTCGCCA GGACCTCGCC GCTGCAGACC CCGGCTGCCC CCGGCGCCGC 
1701 CGCGGGGCCT GCGCTCAGCC CGGTGCCACC TGTGGTCCAC CTGGCCCTCC 
17 51 GCCAAGCCGG CGACGACTTC TCCCGCCGCT ACCGCGGCGA CTTCGCCGAG 
1801 ATGTCCAGCC AGCTGCACCT GACGCCCTTC ACCGCGCGGG GACGCTTTGC 
1851 CACGGTGGTG GAGGAGCTCT TCAGGGACGG GGTGAACTGG GGGAGGATTG 
1901 TGGCCTTCTT TGAGTTCGGT GGGGTCATGT GTGTGGAGAG CGTCAACCGG 




1951 


GAGATGTCGC 


CCCTGGTGGA 


CAACATCGCC 


CTGTGGATGA 


CTGAGTACCT 


2001 


GAACCGGCAC 


CTGCACACCT 


GGATCCAGGA 


TAACGGAGGC 


TGGGATGCCT 


2051 


TTGTGGAACT 


GTACGGCCCC 


AGCATGCGGC 


CTCTGTTTGA 


TTTCTCCTGG 


2101 


CTGTCTCTGA 


AGACTCTGCT 


CAGTTTGGCC 


CTGGTGGGAG 


CTTGCATCAC 


2151 


CCTGGGTGCC 


TATCTGAGCC 


ACAAGTGAAG 


TCAACATGCC 


TGCCCCAAAC 


2201 


AAATATGCAA 


AAGGTTCACT 


AAAGCAGTAG 


AAATAATATG 


CATTGTCAGT 


2251 


GATGTACCAT 


GAAACAAAGC 


TGCAGGCTGT 


TTAAGAAAAA 


ATAACACACA 


2301 


TATAAACATC 


ACACACACAG 


ACAGACACAC 


ACACACACAA 


CAATTAACAG 


2351 


TCTTCAGGCA 


AAACGTCGAA 


TCAGCTATTT 


ACTGCCAAAG 


GGAAATATCA 


2401 


TTTATTTTTT 


ACATTATTAA 


GAAAAAAGAT 


TTATTTATTT 


AAGACAGTCC 


2451 


CATCAAAACT 


CCGTCTTTGG 


AAATCCGACC 


ACTAATTGCC 


AAACACCGCT 


2501 


TCGTGTGGCT 


CCACCTGGAT 


GTTCTGTGCC 


TGTAAACATA 


GATTCGCTTT 


2551 


CCATGTTGTT 


GGCCGGATCA 


CCATCTGAAG 


AGCAGACGGA 


TGGAAAAAGG 


2601 


ACCTGATCAT 


TGGGGAAGCT 


GGCTTTCTGG 


CTGCTGGAGG 


CTGGGGAGAA 


2651 


GGTGTTCATT 


CACTTGCATT 


TCTTTGCCCT 


GGGGGCGTGA 


TATTAACAGA 


2701 


GGGAGGGTTC 


CCGTGGGGGG 


AAGTCCATGC 


CTCCCTGGCC 


TGAAGAAGAG 


2751 


ACTCTTTGCA 


TATGACTCAC 


ATGATGCATA 


CCTGGTGGGA 


GGAAAAGAGT 


2801 


TGGGAACTTC 


AG ATGG AC CT 


AGTACCCACT 


GAGATTTCCA 


CGCCGAAGGA 


2851 


CAGCGATGGG 


AAAAATGCCC 


TTAAATCATA 


GGAAAGTATT 


TTTTTAAGCT 


2901 


ACCAATTGTG 


CCGAGAAAAG 


CATTTTAGCA 


ATTTATACAA 


TATCATCCAG 


2951 


TACCTTAAAC 


CCTGATTGTG 


TATATTCATA 


TATTTTGGAT 


ACGCACCCCC 


3001 


CAACTCCCAA 


TACTGGCTCT 


GTCTGAGTAA 


GAAACAGAAT 


CCTCTGGAAC 


3051 


TTGAGGAAGT 


GAACATTTCG 


GTGACTTCCG 


ATCAGGAAGG 


CTAGAGTTAC 


3101 


CCAGAGCATC 


AGGCCGCCAC 


AAGTGCCTGC 


TTTTAGGAGA 


CCGAAGTCCG 


3151 


CAGAACCTAC 


CTGTGTCCCA 


GCTTGGAGGC 


CTGGTCCTGG 


AACTGAGCCG 


3201 


GGCCCTCACT 


GGCCTCCTCC 


AGGGATGATC 


AACAGGGTAG 


TGTGGTCTCC 


3251 


GAATGTCTGG 


AAGCTGATGG 


ATGGAGCTCA 


GAATTCCACT 


GTCAAGAAAG 


3301 


AGCAGTAGAG 


GGGTGTGGCT 


GGGCCTGTCA 


CCCTGGGGCC 


CTCCAGGTAG 


3351 


GCCCGTTTTC 


ACGTGGAGCA 


TAGGAGCCAC 


GACCCTTCTT 


AAGACATGTA 
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3401 


TCACTGTAGA 


GGGAAGGAAC 


AGAGGCCCTG 


GGCCTTCCTA 


TCAGAAGGAC 


3451 


ATGGTGAAGG 


CTGGGAACGT 


GAGGAGAGGC 


AATGGCCACG 


GCCCATTTTG 


3501 


GCTGTAGCAC 


ATGGCACGTT 


GGCTGTGTGG 


CCTTGGCCAC 


CTGTGAGTTT 


3551 


AAAGCAAGGC 


TTTAAATGAC 


TTTGGAGAGG 


GTCACAAATC 


CTAAAAGAAG 


3601 


CATTGAAGTG 


AGGTGTCATG 


GATTAATTGA 


CCCCTGTCTA 


TGGAATTACA 


3651 


TGTAAAACAT 


TATCTTGTCA 


CTGTAGTTTG 


GTTTTATTTG 


AAAACCTGAC 


3701 


AAAAAAAAAG 


TTCCAGGTGT 


GGAATATGGG 


GGTTATCTGT 


ACATCCTGGG 


3751 


GCATTAAAAA 


AAAATCAATG 


GTGGGGAACT 


ATAAAGAAGT 


AACAAAAGAA 


3801 


GTGACATCTT 


CAGCAAATAA 


ACTAGGAAAT 


TTTTTTTTCT 


TCCAGTTTAG 


3851 


AATCAGCCTT 


GAAACATTGA 


TGGAATAACT 


CTGTGGCATT 


ATTGCATTAT 


3901 


ATACCATTTA 


TCTGTATTAA 


CTTTGGAATG 


TACTCTGTTC 


AATGTTTAAT 


3951 


GCTGTGGTTG 


ATATTTCGAA 


AGCTGCTTTA 


AAAAAATACA 


TGCATCTCAG 


4001 


CGTTTTTTTG 


TTTTTAATTG 


TATTTAGTTA 


TGGCCTATAC 


ACTATTTGTG 


4051 


AGCAAAGGTG 


ATCGTTTTCT 


GTTTGAGATT 


TTTATCTCTT 


GATTCTTCAA 


4101 


AAGCATTCTG 


AGAAGGTGAG 


ATAAGCCCTG 


AGTCTCAGCT 


ACCTAAGAAA 


4151 


AACCTGGATG 


TCACTGGCCA 


CTGAGGAGCT 


TTGTTTCAAC 


CAAGTCATGT 


4201 


GCATTTCCAC 


GTCAACAGAA 


TTGTTTATTG 


TGACAGTTAT 


ATCTGTTGTC 


4251 


CCTTTGACCT 


TGTTTCTTGA 


AGGTTTCCTC 


GTCCCTGGGC 


AATTCCGCAT 


4301 


TTAATTCATG 


GTATTCAGGA 


TTACATGCAT 


GTTTGGTTAA 


ACCCATGAGA 


4351 


TTCATTCAGT 


TAAAAATCCA 


GATGGCGAAT 


GACCAGCAGA 


TTCAAATCTA 


4401 


TGGTGGTTTG 


ACCTTTAGAG 


AGTTGCTTTA 


CGTGGCCTGT 


TTCAACACAG 


4451 


ACCCACCCAG 


AGCCCTCCTG 


CCCTCCTTCC 


GCGGGGGCTT 


TCTCATGGCT 


4501 


GTCCTTCAGG 


GTCTTCCTGA 


AATGCAGTGG 


TCGTTACGCT 


CCACCAAGAA 


4551 


AGCAGGAAAC 


CTGTGGTATG 


AAGCCAGACC 


TCCCCGGCGG 


GCCTCAGGGA 


4601 


ACAGAATGAT 


CAGACCTTTG 


AATGATTCTA 


ATTTTTAAGC 


AAAATATTAT 


4651 


TTTATGAAAG 


GTTTACATTG 


TCAAAGTGAT 


GAATATGGAA 


TATCCAATCC 


4701 


TGTGCTGCTA 


TCCTGCCAAA 


AT CATTTTAA 


TGGAGTCAGT 


TTGCAGTATG 


4751 


CTCCACGTGG 


TAAGATCCTC 


CAAGCTGCTT 


TAGAAGTAAC 


AATGAAGAAC 


4801 


GTGGACGTTT 


TTAATATAAA 


GCCTGTTTTG 


TCTTTTGTTG 


TTGTTCAAAC 



Fig. 7-4 



4851 GGGATTCACA GAGTATTTGA AAAATGTATA TATATTAAGA GGTCACGGGG 
4901 GCTAATTGCT AGCTGGCTGC CTTTTGCTGT GGGGTTTTGT TACCTGGTTT 
4951 TAATAACAGT AAATGTGCCC AGCCTCTTGG CCCCAGAACT GTACAGTATT 
5001 GTGGCTGCAC TTGCTCTAAG AGTAGTTGAT GTTGCATTTT CCTTATTGTT 
5051 AAAAACATGT TAGAAGCAAT GAATGTATAT AAAAGC 
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